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FOREWORD 



I first presented the lecture "Areas and Logarithms" in the 
autumn of 1951 at Moscow University to a large audience of 
senior schoolchildren intending to participate in the Mathematics 
Olympiad. The aim of the lecture was to present a geometric 
theory of logarithm, in which logarithms are introduced as 
various areas, with all the properties of the logarithms then 
being derived from those of the areas. The lecture also introduced 
the most simple concepts and elements of integral calculus, 
without resort to the concept of a derivative. 

The lecture is published in this booklet with some additions. 
The reader can begin the book without even knowing what a 
logarithm is. He need only have a preliminary knowledge of the 
simplest functions and their graphical representation, of geometric 
progressions, and of the concept of limit. 

If the reader wishes to broaden his knowledge of logarithms 
he is referred to the books The Origin of Logarithms by 
I. B. Abelson and Series by A. I. Markushevich (in Russian). The 
last chapter of the latter book contains an alternative theory of 
logarithms to that presented here. 

The present publication includes a Supplement in which Simpson's 
rule and some of its applications can be found. In particular, 
the number ;t is calculated. 

The author 



1. Suppose a function is given which means that a method is 
indicated which allows us to associate every value of x with a 
corresponding value of y (the value of the function). Usually 
functions are defined by formulas. For instance, the formula y = x 2 
defines y as a function of x. Here, for every number x (say, 
x = 3) the corresponding value of y is obtained by squaring the 
number x (y = 9). The formula y = 1/x defines another function. 
Here for every nonzero x the corresponding value of y is the 
number inverse to x; if x = 2 then y = 1/2 and if x = — 1/2 
then y = — 2. 

When we speak of a function without indicating what particular 
function is meant we write y =f(x) (read "y is f of x"). This 
means that y is a function of x (perhaps y = x 2 , or y = 1/x, 
or some other function). Recall the idea of number lettering: 
the method just described closely resembles it, for we can speak 
either of the numbers 2, - 1/2, J/2 or of a number a, 
understanding it as one of these or any other number. Just as 
we use different letters to designate numbers, so we can describe a 
function as y=/(x), or use some other notation, for instance 
y = g(x), or y = h(x), etc. Thus, if a problem involves two 
functions, we can denote one of them as y =/(x) and the other 
as y — 9 ( x \ an d so on. 

The function y =/(x) can be shown graphically. To do this we 
take two mutually perpendicular straight lines Ox and Oy — the 
coordinate axes (see Fig. 1) — and, after choosing the scale unit, 
mark off the values of x on the x-axis and the corresponding 
values of y =/(x) on the lines perpendicular to Ox (in the 
xOy plane). In so doing the rule of signs must be adhered to: 
positive numbers are denoted by line segments marked off to the 
right (along the x-axis) or upwards (from the x-axis) and negative 
numbers are marked off to the left or downwards. Note that 
the segments marked off from the point O along the x-axis are 
called abscissas and the segments marked off from Ox at right angles 
to it are called ordinates. 

When the construction just described is carried out for all 
possible values of x, the ends of the ordinates will describe a 
curve in the plane which is the graph of the function y =/(x) 
(in the case of y = x 2 the graph will be a parabola; it is shown in 
Fig. 2). 

Take any two points A and B on the graph (Fig. 1) and drop 



from them perpendiculars AC and BD to the x-axis. We obtain a 
figure ACDB; such a figure is called a curvilinear trapezoid. 
If, in a special case, the arc AB is a line segment not 
parallel to Ox, then an ordinary right-angled trapezoid is obtained. 
And if AB is a line segment parallel to Ox, then the resulting 
figure is a rectangle. 

Thus, a right-angled trapezoid and a rectangle are special 
cases of a curvilinear trapezoid. 




Fig. 1 



Fig. 2 



The graph of the function depicted in Fig. 1 is located above 
the x-axis. Such a location is possible only when the values of the 
function are positive numbers. 

In the case of negative values of the function the graph is 
located below the x-axis (Fig. 3). We then agree to assign a 
minus sign to the area of the curvilinear trapezoid and to 
consider it as negative. 

Finally it is possible for the function to have different signs 
for the different intervals of the variation of x. Its graph is then 
located partly above Ox and partly below it; such a case is shown 
in Fig. 4. Here the area A' CUB' of the curvilinear trapezoid 
must be considered negative and the area A"C"D"B" positive. 
If in this case we take points A and B on the graph, as shown 
in the figure, and drop perpendiculars AC and BD from them to the 
x-axis, we then obtain a figure between these perpendiculars which is 
hatched in Fig. 4. The figure is called a curvilinear trapezoid, 
as before; it is bounded by the arc AKA'B'LA"B"B, two 
ordinates AC and BD and a segment CD of the abscissa axis. 
We take as its area the sum of the areas of the figures ACK, 
KA'B'Land LA'B'BD, the areas of the first and the third of them 
being positive and the area of the second negative. 

The reader will readily understand that under these conditions 
the area of the whole curvilinear trapezoid ACDB may turn out 



to be either positive or negative, or in some cases equal to 
zero. For instance, the graph of the function 

y = ax (a > 0) 

is a straight line; here the area of the figure ACDB (Fig. 5) 
is positive for OD > OC, negative for OD < OC, and equal to 
zero in the case of OD = OC. 

y 




Fig. 3 



Fig. 4 



2. Let us determine the area S of a curvilinear trapezoid. 
The need to calculate areas arises so often in various problems of 
mathematics, physics and mechanics that there exists a special 
science — integral calculus — devoted to methods of solving problems 
of this kind. We shall begin with a general outline of the 
solution of the problem, dividing the solution into two parts. 
In the first part we shall seek approximate values of the area, 
trying to make the error in the approximation infinitely small; 
in the second part we shall pass from the approximate values 
of the area to the exact value. 

First let us replace the curvilinear trapezoid ACDB by a stepped 
figure of the type shown in Fig. 6 (the figure is hatched). It is 
easy to calculate the area of the stepped figure: it is equal to 
the sum of the areas of the rectangles. This sum will be 
considered as being the approximate value of the sought area S. 

When replacing S by the area of the stepped figure we make 
an error a; the error is made up of the areas of the curvi- 
linear triangles blacked-out in Fig. 6. To estimate the error let us 
choose the widest rectangle and extend it so that its altitude 
becomes equal to the greatest value of the function (equal to 
BD in the case of Fig. 6). Next let us move all the curvi- 
linear triangles parallel to the x-axis so that they fit into that 
rectangle; they will form a toothed figure resembling the edge of a saw 
(Fig. 7). Since the whole figure fits into the rectangle, the error a 



equal to the sum of the areas of the saw teeth * must be less than 
the area of the rectangle. If its base, is 8 we obtain | a | < 8 • BD. 
Hence, the error a can be made infinitesimal if the rectangles in 
Fig. 6 are taken to be so narrow that the base 8 of the widest 
of them is a sufficiently small number. For example, if BD = 20, 
and we want the area of the stepped figure to differ from S by 
less than 0.001, we must assume 8BZ) = 208 to be less than 





Fig. 5 



Fig. 6 



0.001, i.e. 5 < 0.00005. Then 

|a|<8B£>< 0.001. 

Nevertheless, however small we make 8, an error a will result 
every time, if only a very small one, since the area of the 
curvilinear trapezoid does not equal that of the stepped figure. 

The second, and final, part of the solution of the problem consists 
in passing to the iimit. We assume that we consider not one, 
and not two, but an infinite number of stepped figures such as 
the one shown in Fig. 6. We take more and more rectangles, 
increasing their number indefinitely, making the base 8 of the 
widest rectangle smaller and smaller, so that it tends to zero. 
The error ot resulting from the replacement of the area of the 
curvilinear trapezoid by the area of the stepped figure will 

In Figs. 6 and 7 the graph of the function is like the slope up (or 
down) a hill. Were the graph more complicated, with alternating rises 
and descents (see, for instance, Fig. 4), then the curvilinear triangles 
transferred into a single rectangle would overlap and the sum of their areas 
could turn out to be larger than the area of the rectangle. To make our 
reasoning applicable to this more complex case as well, let us divide the 
figure into parts to make the graph, within the limits of a single part, 
look like a single rise, or a single descent, and consider each part separately. 



then become increasingly small, and indefinitely approach zero 
as well. The required area S will be obtained as the limit of the 
areas of the stepped figures. 

3. Let us follow the reasoning used in Section 2 to estimate the 
area of the curvilinear trapezoid in the very important special 
case when the function y =f(x) is a power with an integral 
nonnegative exponent y = x*. For the exponents k = 0, 1, 2, we 
obtain the functions y = x° = 1, y = x 1 = x, y = x 2 . Their graphs 




Fig. 7 



are easy to construct: they are, respectively, a straight line 
parallel to the x-axis and passing above Ox at a unit distance 
(Fig. 8), a bisector of the angle xOy (Fig. 9), and a parabola 
(Fig. 10). 

If we use higher exponents, we obtain the functions y = x 3 , 
y = x 4 , y = x 5 . These graphs are shown in Figs. 11, 12 and 13. 

If k is an odd number, the graphs are symmetrical with 
respect to the point (Figs. 9, 11, 13), and if k is an even 
number, then they are symmetrical with respect to the y-axis 
(Figs. 8, 10, 12). 

If k > 1, the graphs pass through the point O. In this case the 
greater the value of k, the closer the graphs are to the x-axis 
in the proximity of the point and the steeper they rise 
upwards (or fall downwards) as they recede from the point 0. 

Each of the figures 8-13 contains a hatched curvilinear trapezoid. 
The areas of these trapezoids are easy to find when k = 
and k = 1. Indeed, if k = 0, the area of A CDB is equal to CD AC = 
= (b - a)- 1 = b - a; if k = 1, then the area of ACDB is 



„_ AC + BD a + b 

CD =(6_fl)_ -_ 



b 2 -a 2 



10 



y 



f»-Q) A 8 






Fig. II 




Fig. 12 
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Let us prove that if k = 2, the area of ACDB is equal to 

b 3 - a 3 fc 4 - a 4 

— - ; ;." k = 3, the area of /ICDB is - — -— , etc. We shall 

3 4 

prove that in the general case, when k is any integral nonnegative 
number, the area of the corresponding curvilinear trapezoid is 

b k+i _ a k + i 
equal to — . It is evident that this general result 

covers all the special cases discussed above. 




Fig. 13 

To make it easier to follow the calculations below let us assume 
a definite numerical value for the exponent k, say k = 5. Let 
us further suppose that < a < b. Consequently we refer to the 
graph of the function y = x 5 and, following the reasoning 
outlined in Section 2, prove that the area of the curvilinear 

b 6 -a 6 

trapezoid ACDB (Fig. 14) is equal to . 

6 

4. We have to calculate the sum of the areas of the very 

large number of rectangles which make up the stepped figure 

(Fig. 14). To simplify our job we choose the rectangles so 

that their areas form a geometric progression. To do this 

we take the points E, F, G, H, . . . , / on the x-axis so as to 

make the line segments OC = a, OE, OF, OG, ..., 01, OD = b 

form a geometric progression: we designate the number of terms in 

this progression by n + 1 and its common ratio by q (since 

12 



b > a, q > 1). Then we have the equalities 

OC = a, 0E = aq, OF = aq 2 , 
OG=aq 3 , ..., Ol^aq"- 1 , OD ±= aq" = b. 

Fig. 14 depicts six rectangles and hence n + 1 = 7, but we 
shall suppose in what follows that n is an arbitrarily large number, 
say n = 1000, 10000, 100000, etc. 




Fig. 14 



The bases of the rectangles form a geometric progression with 
the same common ratio q: 

CE = OE-OC = a{q - 1), EF = OF - OE = aq(q - 1), 

FG = OG-OF = aq 2 (q-l), ..., 

ID = OD-OI = aq"- 1 (q- 1) 

13 



(the number of terms in this and the following progressions 
is equal to n and not to n + 1). 

The altitudes of the rectangles are the ordinates CA, EE U 
FFj, GG X , ..., II x; each of them is equal to the fifth power of the 
abscissa corresponding to it (remember that we have assumed 
y = x 5 ). Hence 

CA = OC 5 = a 5 , EEi = OE 5 = a 5 q 5 , FF 1 = OF 5 = a s q 10 , 

GGi =a 5 q i$ , ...,//, =0/ 5 =aV , ""' ) - 

We see that the altitudes of the rectangles also form a geometric 
progression with the common ratio q 5 ( = <j*). 

Since the bases of the rectangles form a progression with 
the common ratio q and the altitudes form a progression with the 
common ratio q 5 (= q k ), the areas of the rectangles must form a 
progression with the common ratio qq s = q 6 (=q k+1 ): 

CE-CA =a(q- l)a 5 =a 6 (q- 1); 
EFEEi = aq{q-\)a 5 q s =a 6 q 6 {q- 1); 
FGFF, = aq 2 (q - l)a 5 q 10 = a 6 q i2 (q ~ 1); 



ID ■ //, = aq n - ' (q - l)a 5 q Sitt - " = a 6 q 6( "~ "fa - !)• 

Therefore the sum of the areas of the rectangles equal to the 
area of the stepped figure is the sum of the geometric progression 
with the first term a b (q- 1), the last term a t q H "~ l) {q- 1) and 
the common ratio q*: 

a *q^ ( q- l)q <-a<(q-l)^ (aqr _ a6iq -l _ 



q 6 - 1 LV -" ' J q 6 - 1 



_ U 



q s + q* + q 3 + q 2 + q + 1 

q 6 -l 
we have used the equalities b = aq" and — = q 5 + q* + 



+ q+q*+q+l\. 

5. Let us increase indefinitely the number of the rectangles n. 
Since the bases of the rectangles form an increasing geometric 
progression (q > 1), the first one must be the smallest. But the sum 
of the lengths of all the n bases is equal to b - a; therefore 

14 



the length of CE is less than — , i. e. aq — a < , whence 

u n n 

. b-a 
q-\< 



na 

The right-hand side of the last inequality tends to zero when n 
increases indefinitely; since the left-hand side is positive it must 
tend to zero too, that is, q tends to unity. 

This implies that q 2 , q 3 , q* and q 5 also tend to unity, 
the sum q 5 + q 4 + q 3 + q 2 + q + 1 tends to 1 + 1 + 1 + 1 + 1 + 1 = 
= 6 and, hence, the whole area of the stepped figure, equal to 

b« - a 6 



tends to the limit 



q s + q* + q 3 + q 2 + q + 1 ' 
b 6 -a 6 



The required area of the curvilinear trapezoid must equal that 
particular limit: 

_ _ b 6 - a 6 
S ~ ~ 6~~- 

We have obtained this result for k = 5. If we were to make these 
calculations in the general case for any natural k, we would 
obtain 

b k+1 -a k+l 



S = 



fc+1 



Thus we have proved that the area of the curvilinear trapezoid 
bounded above by the arc of the graph of the function 
y = x* and located between two ordinates with the abscissas a and b 

is equal to 



k+l 

6. We obtained the results of the previous section assuming 
that < a < b, i. e. that the curvilinear trapezoid lies to the right 
of the y-axis. If a < b < 0, the proof is carried out in the same way. 
However, assuming the common ratio q of the progression to be 
positive and greater than unity as before, we must now take b as 
the first and a as the last term of the progression (since 
\a\ > \b\). Repeating the computations we receive the same result: 

c _ b k+l -a k+i 

S ~ k+l 

15 



If k is an odd number, then k + 1 is even and, hence, b k+ 1 
and a k+i are positive numbers, the first number being less than the 
second. Therefore, in this case S is expressed by a negative 
number. This is clearly to be expected, since for odd k the 
corresponding curvilinear trapezoid lies below the x-axis (see the 
left-hand parts of Figs. 11 and 13). 

Let us return to the case < a < b. If we consider b to be 
invariable and make a tend to zero, then the -curvilinear 
trapezoid will extend to the left, and for a equal to zero 
will turn into a curvilinear triangle OBD (Fig. 15) (we assume 
that k > 1). It is obvious that for a tending to zero the area of the 




Fig. 15 



curvilinear trapezoid will tend to the area of the curvilinear 
triangle. Indeed, the difference between the second and the 
first area will be less than the area OCAL, which itself tends to zero. 
On the other hand, for a tending to zero, the area of the 

curvilinear trapezoid tends to -; — , as is seen from the obtained 

k + 1 
formula. Therefore the area of the curvilinear triangle ODB is 

b k+1 



equal to 



k+ 1 



i. e. it is k + 1 times less than the area of the 



rectangle ODBK, or, the same thing, k + 1 times less than the 
product of the "legs of the right-angled triangle" ODB (we have 
put inverted commas since we speak here not of the ordinary 
triangle, but of the curvilinear one). For k = 1 we have a function 
y = x, with the graph becoming a straight line (see Fig. 9) 
and the triangle an ordinary right triangle with its area equalling 

. = -y of the product of the legs. 



16 



We obtain analogous results if we proceed from the assumption 
a < b < (the curvilinear trapezoid is located to the left of Oy). 
Taking a to be invariable we make b tend to zero; then the expression 



-a" 



tends to the limit — 



-. This will be exactly the 



k+ 1 k+ 1 

area of the corresponding curvilinear triangle. 

A curvilinear triangle can be considered as a special case of a 
curvilinear trapezoid. From .what we have established it follows 
that the formula 

remains valid for a curvilinear triangle as well. We need only put 
in it a = (if the triangle lies to the right of Oy) or 
b = (if the triangle lies to the left of Oy). 

7. Let us return to the general problem of the areas of curvilinear 
trapezoids. Let ACDB be a curvilinear trapezoid bounded by the arc 




Fig. 16 



AB of the graph of the function y =f(x), two perpendiculars AC 
and BD dropped from the end points of the arc to the 
x-axis, and by the segment CD of the line Ox intercepted 
by the feet of the perpendiculars (Fig. 16). If OC = a and OD = b 
with a < b, then the area ACDB is designated 



\f{x)dx. 



W 



Every detail in this designation has a definite meaning. Here we have 
a function f(x) whose graph is the upper part of the boundary 
of the curvilinear trapezoid, and also the numbers a and b, 



2-QU.-,2 
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specifying the right part and the left part of the boundary. 
The designation (*) defines the method of seeking the area of 
ACDB; this method was given in Sections 2 and 3 and involves 
calculating the sum of the areas of the rectangles constituting 
the stepped figure and passing to the limit. The sign J is 
the extended letter S, the initial letter of the Latin word summa, 
sum. The unusual shape of the letter S implies that the calculation 
of the area of the curvilinear trapezoid involves not only summation, 
but must also include the passage to the limit. To the right 
of the sign j, which is called the integral sign (from the 
Latin integer meaning full, entire), is the product f(x)dx. It represents 
the area of the rectangle with altitude f(x) and base dx. The 
letter d standing before x is the initial letter of the Latin word 
differentia meaning difference; dx denotes the difference between 
the two vaiues of x (see Fig. 16): dx = x' — x. The number a 
is the lower limit and b the upper limit of the integral (here 
the word "limit" means "boundary"). 

Thus, the designation (*) for the area of the curvilinear 
trapezoid carries, on the one hand, all the information concerning 
its form and dimensions (given by the numbers a and b and by 
the function f(x)\ and, on the other hand, contains within it the 
method of seeking the area of the trapezoid by calculating 
the areas of the rectangles with altitudes f{x) and bases dx, by 
summing these areas and by passing to the limit (the integral 
sign shows that the summation and passing to the limit must be 
carried out). The designation (*) should be read as "the integral 
from a to b, f of x dx". We repeat once again that this 
designation expresses the area of the curvilinear trapezoid ACDB. 
Using the new designation we may express the results of Section 5 
as follows: 

* b* +1 -a* +1 

\x k dx = 



fc+1 



(k is an integral nonnegative number). The last equation should 
be read as "the integral from a to b of x* dx is equal to the 
difference b k+l and a* +1 , divided by k + 1". 

8. Let us define some simple properties of integrals. Obvious- 
ly the area ACDB added to the area BDD'B' yields the 
area ACD'B' (Fig. 17). But the first area is equal to 

b c c 

[f(x)dx, the second to \f(x)dx, and the third to f,/(x)dx; 



therefore we have \f{x)dx + \f(x)dx = \f{x)dx. Here a<b<c; 

a b a 

now if a < c < b (Fig. 18), then, taking into account that the areas 
ACD'B' and B'D'DB together yield the area ACDB, we obtain 

lf(x)dx + \f(x)dx = \f(x)dx. 



In introducing the concept of the integral $f(x)dx in Section 7 

a 

we assumed that a < b, i. e. that the lower limit is less than the 





Fig. 17 Fig. 18 

upper limit. That is why the area BDD'B', where OD = b 

c 

and OD' = c for b< c (Fig. 17), was written as ^f(x)dx 

b 
b 

and for b > c (Fig. 18) as \f(x)dx (every time the lower limit 
is less than the upper limit). In the first case the difference 

c b c 

between the integrals \f(x)dx — \f(x)dx was equal to $f(x)dx, and 



in the second case to \f(x)dx (we have made use of the equalities 

c 

of the integrals written above). To be able to express both cases 
by the same formula we shall agree that for b > c we may write 

\f(x)dx=-]f(x)dx. 

b c 

19 



In other words, we shall now include an integral whose lower 
limit is greater than the upper limit, meaning by this the area of a 
curvilinear trapezoid taken with the opposite sign. Then, instead 
of the two different formulas 

jf(x) dx - ]f(x) dx = \f(x) dx (b < c) 



and 

\f(x)dx - \f(x)dx = - \f(x)dx (b > c) 



we shall write in all cases 

\f(x)dx - ]f(x)dx = ]f(x)dx (*>#c). 

a a. b 

For b = c the left-hand side vanishes; therefore it is legitimate 

b 

to consider the integral \f{x)dx, assuming it to be equal to zero. 

b 

Thus, irrespective of whether b < c, b> c or b = c, we can 
use the formula 

jf(x)dx-\f(x)dx = {f(x)dx 

a a b 

in all cases. This formula can also be written as follows: 
\f(x)dx + \f(x)dx = \f(x)dx. 



We leave it to the reader to verify, with the results established 
in this Section, that the formula 

^ b k + i_ a t + i 



$x k dx = 



fe+1 



is valid for any a and b (and not only for < a < b or 
a < b < 0). 

9. Assume that f(x) is written as the sum or the difference of 
two functions :f(x) = g(x) + h(x) or/(x) = g(x) — h(x) (for example, 
f(x) = x 3 — x 5 ). Then the integral of f(x) can be replaced by the 
sum or the difference of the integrals of the functions g(x) and 

20 



h(x): 

\f(x)dx = \g(x)dx + 'jh(x)dx 



b b b \ 

or \f(x)dx = [g(x)dx - \h{x)dx I. For example, 

a " a / 

" " b b* -a* b b -a 6 

j(x 3 - x 5 )dx = ]x 3 dx - ]x s dx = - a - ^—^- 



Let us prove this property of integrals, taking the case of the sum. 
Let f(x) =g(x) + h(x); the graphs of the three" functions g(x), h(x) 
and f(x) are depicted in Fig. 19. We have to prove that 

\f(x)dx = \g(x)dx + 'jh(x)dx, 

a a a 

that is, that the area of A CDB is equal to the sum of the areas 
of AiCiD^Bi and A^CiP^B-^. Let us divide the segment of the 
x-axis between the points x - a and x = b into parts and construct 
the corresponding stepped figures for the three curvilinear 
trapezoids depicted in Fig. 19. It is evident that the area of each 
rectangle in the lower part of the figure is equal to the sum of the 
areas of the two rectangles shown in the two upper parts of the 
figure. Therefore the area of the lower stepped figure is equal 
to the sum of the areas of the two stepped figures lying 
above. This connection between the areas of the stepped figures 
will remain no matter how we divide the interval on the x-axis 
between x = a and x = b. If this interval is divided into an 
indefinitely increasing number of parts whose lengths tend to zero, 

b 

then the lower area will tend to the limit \f{x)dx and 

a 

the two parts lying above will tend to the limits 

6 6 

\g{x)dx and \h(x)dx. 



Since the limit of the sum is equal to the sums of the limits, 
\f(*)dx = i0(x) + h(x)]dx = \g(x)dx + ]h(x)dx; 

a a a a 

and that is what we had to prove. 

21 



In just the same way it can be proved that 

][g(x) - h(x)\dx = ]g(x)dx - ]h(x)dx. 

a a a 

It is easy to see that this property of integrals is also valid 
when f(x) is the sum of the greater number of summands. For 



y 


A 


c, 




(*. 


) 




B, 







3 










b '* 




Fig. 19 



example, if f(x) = g(x)- h(x) + k(x), then 
][g(x)-h(x) + k(xj]dx = 

a 

= llg(x)-h(x)]dx+\k(x)dx = 

» a 

= ]g{x)dx - \h(x)dx + jk(x)dx. 

a a a 
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10. We have also to find the connection between the integrals 
]f{x)dx and \Cf(x)dx, 

a a 

where C is some (constant) number; for instance, we have to find 

b b 

the connection between the integrals jx 3 dx and \2x 3 dx. Let 

a a 

us show that 

\cf(x)dx = c]f(x)dx, 

a a 

for example, 

* * h A - a* h* -a 4 

\2xUx = 2 $x 3 dx = 2 ^— — = b a . 

a a 4 2 

To simplify the reasoning we shall give to C a definite numerical 
value, say, C = — . Now we have to compare the integrals 



b 

\f(x)dx and 



b 



jf(x)dx. 



Fig. 20 depicts the curvilinear trapezoids whose areas are represented 
by these integrals. Let us divide the segment of the x-axis 
between the points x = a and x = b into a number of parts and 
construct the corresponding stepped figures. It is easy to see that 
the area of each rectangle in the lower part of the figure is equal 
to half the area of the rectangle lying above it (from the upper 
part of the figure). Therefore the area of the lower stepped 
figure is half that of the upper stepped figure. Passing to the 
limit as we did in Section 9, we conclude that the whole area 
of the lower curvilinear trapezoid is also half the area of the upper 
curvilinear trapezoid: 

b 

±f{x)dx=\]f(x)dx. 



b 

J" 



In our argument the number C was positive; if we assume C 
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to be negative, for instance, C= -~~, then we have to replace 

Fig. 20 by Fig. 21. 

Comparing now the area of ACDB with that of A"C"D"B' 
we find that here, besides the variation in the absolute value of the 



y-f(x) 




area (its decrease by half), the sign changes too. Hence, 



j(-4yw^=-i?/w^- 



It is clear that we assumed C = ± — only for clarity. In general, 
for any C, the equality 



\Cf(x)dx = Clf{x)dx 



is valid. 

To show how the properties of integrals discussed in this 
and in the previous section can be used, let us compute the integral 

24 



](3x 2 -2x'+l)dx. We obtain 



j(3x 2 - 2x + \)dx = ]3x 2 dx - \2xdx + )\dx = 



= 3}x 2 dx-2]xdx + ]x°dx = 



, l 3 -0 3 . l 2 -0 2 l'-O 1 ,1 ,1 , , 

= 3— 2 ^- + - 1 - = 3 T -2 T+ l = [. 




Fig. 21 



11. Let us consider the function 

-, 1 

y = x = — . 
x 

Its graph is referred to as an equilateral hyperbola; it is shown 
in Fig. 22. If the formula for the area of the curvilinear 
trapezoid 

derived earlier for the case fc > 0, is applied to this case, 
then, noting that k + 1 = 0, b k+ 1 = a* + ' = 1, we obtain in the right- 
hand term the meaningless expression — . Hence our formula 

cannot be used in the case of k = 1. 
Although the formula cannot be employed for the calculation 

b 

of the integral jx~ l dx, we can nevertheless study some properties 

a 

of this integral. 
We can prove if a and b are increased or decreased 
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the same number of times, i. e. are multiplied by the same q > 0, a 
new curvilinear trapezoid with the same area results. Needless to say 
we prove this property assuming that the curve whose arcs bound the 
curvilinear trapezoids from one side is an equilateral hyperbola and 




Fig. 22 



not some other curve. In other words, 

bq b 

f. x~ 1 dx = |x"'</x, 



for any q (q > 0). 

To make it easier to follow the proof let us give to q a 
definite numerical value, for example q = 3. Fig. 23 depicts two 
curvilinear trapezoids ACDB and A"C'D'B" corresponding to this case. 
The first trapezoid is narrower but higher, the second wider but 
lower. Let us prove that the increase in width in the second 
case is compensated for by the decrease in height, so that the 
area remains the same. To carry out the proof let us divide the first 
trapezoid into several narrower trapezoids and replace each of the 
latter trapezoids by a rectangle (Fig. 23). If the abscissa of each 
point of the stepped figure ACDB constructed above is trebled and 
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the ordinates are left unchanged, a figure A CUB' results whose area is 
three times as large, since each rectangle becomes three times as 
wide. But the ends of the ordinates are no longer located 
on our hyperbola. Indeed, this hyperbola is the graph of the inverse 

proportion y = — , and to keep the points on the hyperbola the 

ordinate must be decreased by the same number of times as 
the abscissa is increased. If all the ordinates of the figure 
A'C'D'B' are decreased three times the resulting figure is 
A"C'D'B", which is a curvilinear trapezoid bounded above by the arc 

of the hyperbola y = — and from the sides by the ordinates 




Fig. 23 

constructed for x = 3a and x = 3b. The resulting rectangles have 
bases three times as large as the original rectangles and altitudes 
three times as small. Hence their areas are the same as those of the 
original rectangles. Consequently, the areas of the two stepped 
figures are identical, as are their limits, i. e. the areas of the 
curvilinear trapezoids: 

qb b 

$x- 1 dx = $x- i dx. 

qa a 



We have proved this property assuming that a < b. But it is also 
true for a = b and a > b. Indeed, if a = b, then aq = bq and both 
integrals vanish, and hence the equality is not violated. If, 
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however, a > b, then aq > bq; in this case we have the equality 

bq ft 

(now b < a and therefore b and a exchange the roles they play). 

ft a 

But we agreed in Section 8 that \f{x)dx means -\f{x)dx for 

a b 

a> b. Consequently 



']x- l dx=-"Jx- i dx, \x- i dx=-]x~ i dx. 

aq bq & 6 

Since the right-hand sides of these relations are equal, the 
left-hand sides should be equal too: 

bq b 

$x~ i dx = lx- 1 dx. 

aq a 

Thus, the relation we have proved remains valid irrespective of 
whether a < b, a = b or a > b. 

b 

12. Now let us assume a = \ and consider \x~ l dx. If 

i 

b > 1, the integral represents the area of the curvilinear trapezoid 

ACDB (Fig. 24). Now if b = 1, it turns into zero, and, finally, 

if < b < 1, then the lower limit of the integral is less than the 

upper limit and we obtain 

\x' l dx= -\x- x dx. 

i ft 

This means that in this case the integral differs from the area of the 
curvilinear trapezoid B'D'CA only in its sign (Fig. 25). In any 

ft 
case, for every positive value of b the integral \x~*dx has 

i 
quite a definite value. It is positive when b > 1, equal to zero 

when b = 1 and negative for b < 1. 

ft 
It is evident that the integral jx"'dx is a function of b. 

i 
This function plays a very important role in mathematics; it is 
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called a natural logarithm of the number fo and is designated In fo. 
Here / and n are the initial letters of the Latin words 
logarithmus — logarithm, and naturalis — natural. Thus, 

\x~ l dx = \nb. 

i 

Let us discuss some properties of natural logarithms. First of 
all we have 

lnfo>0 if b> 1; In 1 = 0; Info < if b < 1. 

Next we derive the main property of a logarithm, the fact that 
the logarithm of a product is equal to the sum of the logarithms 
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of the factors, for instance, In 6 = In 2 + In 3. In the general case 
In (foe) = Info + lnc, 

that is 

be b c 

J x~ l dx = J x~ i dx + j x~ i dx. 
i i i 

Indeed, according to what was proved above 

c qc 

§x~ l dx = \x' l dx 
i i 

for any q > 0. Let us take q = fo; then we shall have 

\x~ i dx= jx-'dx. 
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Therefore 

i 1 i f> 

But the last sum can be replaced, according to the property derived 

be 

in Section 8, by the integral $x~ i dx. Hence we have 



]x~ l dx + ]x~ l dx= \x' l dx. 



just what we had to prove. 

This property allows us to derive some corollaries. Let b > 0; 
then, according to the property proved above, 



•ni=m(*j) = 



lnft + In—; 
b 



since In 1 =0, In ft + In— = 0, whence 
b 

In-Jr— -In ft. 

b 

For example, In— = — In 2. Next we have 

ln-£- = ln(c-7- J = lnc + ln-r- = lnc- lnft 

if ft > and c> 0; in other words, a logarithm of a quotient 
is equal to the difference of the logarithms of the divident and the 
divisor. 

The main property of the logarithm was formulated for the product 
of two factors, but it is valid for the product of any number of 
the factors as well. Thus, for instance, if we have three factors, 
then we obtain 

ln(ftc<i) = ln[(ftc)<] = ln(ftc) + ln<* = 

= (lnft + lnc) + ln<i = Inft + lnc + In d. 

Evidently, whatever the number of the factors, the logarithm of 
their product is always equal to the sum of the logarithms of the 
factors. 
Let us apply this property to the logarithm of a power with the 
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integral positive exponent k. We have 

lnfc* = ln(bb ... b) = lnb + lnfr + ... + lnb = klnb. 

k times A: times 

For example, In 16 = In 2* = 4 In 2. 
Let c = yb; then c* = b and consequently 

lnfr = lnc* = klnc = fclnVk, 
whence we have 

In 1/5 —4 Ink 

v k 

For instance 

3 r 1 
lnp/2 = yln2. 

p 
If c = b i , where p and q are positive integers, then, by the 
properties proved above, 

InM =ln^ = — In 6* = — p\ab = ^-\ab. 

Y ■ q q q 

Hence, the property 

lnb k = klnb 
is valid not only when k is a positive integer, but also when k is a 

fraction in the form — . 

q 

It is easy to see that the same property is valid for a negative 
k as well (integral and fractional). Indeed, if k < 0, then — k > 
and we have 

ln6* = ln-r^r= -lnb~ k = - (- klnb) = klnb. 

Finally, the same property is valid for k = too: 
lnfr° = lnl=0 = 0-lnft. 

Thus, for any rational k (positive, equal to zero or negative, 
integral or fractional) it is true that 

lnb k = k\nb. 
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It would also be possible to prove that this relation is true for 
an irrational k; for example, 

lnfc^ = j/21n6. 

We shall accept the latter statement without proof and shall 
use the following property: the natural logarithm of a power is equal 
to the exponent multiplied by the natural logarithm of the base of the 
power, for all possible values of the exponent k, both rational 
and irrational. 

13. Let us compute some logarithms. Perhaps we should calculate 
In 2, i.e. find the area of the curvilinear trapezoid ACDB depicted 
in Fig. 26a. We divide the line segment CD into ten equal parts 
and draw the respective ordinates: KiLi, X 2 L 2 , ..., K 9 L g . To find 




Fig. 26 



the best approximation to In 2, we replace each of the ten resulting 
narrow curvilinear trapezoids not by a rectangle as we did before, but 
by an ordinary, i. e. rectilinear trapezoid. To do this, we connect 
point A with L u point L x with L 2 , and so on, ..., L 9 with B 
by line segments. In Fig. 26a it is difficult to distinguish between 
ordinary and curvilinear trapezoids; to make the distinction more 
visible we increase Fig. 26b in scale. The area of each trapezoid 
is equal to the product of half the sum of the bases by the 
altitude; but in our case all the altitudes are equal: 

CK X =K 1 K 2 = ... = K 9 D = 0.1. 
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Therefore, the areas of the trapezoids will be as follows: 
AC+K.L, Q1 K 1 L l + K 2 L 2 Q] . K 9 L g + BD Q) 

the sum of these areas is equal to 

j (^C + A: 1 L 1 ) + (/C 1 L 1 + X 2 L 2 ) +_. . . jJ^Lpj- BD) 

2 ' """ ' 

or to 

0.1(0.5/4C + K,L, + K 2 L 2 + ... + K 9 L 9 + 0.5 BD). 

It remains only to draw the reader's attention to the fact that 
all the bases of the trapezoids are the ordinates of the graph 

of the function y = — corresponding to the following abscissas : 

1; 1.1; 1.2; 1.3; 1.4; 1.5; 1.6; 1.7; 1.8; 1.9; 2. 
Therefore 

AC = -J- = 1.000; K,L, =~ = 0.909; K 2 L 2 = Ty = 0.833; 

K 3 L 3 = ~ = 0.769; K 4 L 4 = -^ = 0.714; K 5 L 5 = j- = 0.667; 

K 6 L 6 =j£ = 0-625; K 7 L 7 = -jy = 0.588; K 8 L 8 = ---'- - 0.556; 



iV = 0.526; BD = T 



K 9 L 9 = T^ = 0.526; BD = — = 0.500. 



Consequently, the sum of the areas of the trapezoids is equal to 

0.1 (0.500 + 0.909 + 0.833 + 0.769 + 0.714 + 0.667 + 

+ 0.625 + 0.588 + 0.556 + 0.526 + 0.250) = 0.6937. 

If we look more carefully at Fig. 26 we shall see that the sum 
of the areas of these trapezoids gives a value somewhat greater 
than that of the area of the curvilinear trapezoid. This means 
that we have found an approximate value for In 2 exceeding 
the real value (a major approximation), i. e. that In 2 is somewhat 
less than 0.6937. 

Later we shall acquaint ourselves with another method of cal- 
culating logarithms which will make it possible to obtain, in 
particular, In 2 with a higher degree of accuracy. 
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14. is the abscissas are laid off not from the point O but 
'Yost 'he point C (Fig. 27) and the new abscissas are denoted 
:;, \K-. letter r, then the connection between the new and the old 
; ;si:;s,as of one and the same point will be 

x ■- 1 + t. 

This -.: onnection will bi, true for any point if we assume that t >. 

wher; - > 1, and t ■$ when < < 1, On substituting 1 + t for x the 
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■v>!ct!o:-i y - — becomes y = - , but the graph remains the 

same. Anything new resulting from the introduction of t refers only 
:c tie origin (C instead of 0), and, hence, to the new axis Cy 
^parallel to Oy); the curve itself remains unaltered. The area 
ACDB does not change either. But when we took x as the abscissa 
this area was expressed by the integral 



J x-'dx = ln(l + p) 



(here <3 = CD). 

Now when we take t for the abscissa, the same area is expressed 

p 
y -b.fi integral J(l + t)~ ' dt. Comparing the two integrals we obtair. 

ln(l +p) = J(l + t)" 1 ^- 

o 



Let us now note the following identity: 

l-f + f 2 -t 3 + ...-f 2 "- 1 - '' -- 



1+t 



We can see it immediately if we note that the left-hand side 
contains a geometric progression with 1 as its first term, - 1 as 
the common ratio and - t 2n ~ l as its last term. This identity 
implies 

1 t 2n 

= 1 -t + t 2 -t 3 + ...-t 2 "' 1 + 



1 + 1 1 + 1 ' 

Therefore 

l+t 



ln(l + p) = f \l - t + t 2 - t 3 + ... - f 2 "" 1 + ''" 



dt. 



Now a more complex and cumbersome expression containing the 
sum of several summands has appeared under the integral sign 
instead of (1 + f) '• W e have learned by now that the integral 
of a sum or a difference of functions is equal to the sum or the 
difference of the integrals of these functions. Hence, 

ln(l + p) = f \dt - ]tdt + \t 2 dt-\t 3 dt + ... 



p 

...-\t 2 "- l dt+ -i 

o J 1 + ' 



dt. 



We are now able to compute each of the integrals on the right- 
hand side except for the last integral. So we can write 

P P R2 p o3 

Jldt-P, \tdt=\, \t 2 dt=±-, 

O Z J 

p R* P ft 2 " 

^ = JL... )K .-., t = JL ] _. 

H o •"' 



It follows that 



h(1 + B _(,-£ + £._£. + ...-£) + r_£ 



f 2n 

l + f 
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The expression in the parentheses on the right-hand side of the 
equation is a polynomial of degree 2/i arranged according to the 
increasing powers of p. If the value of P is known, and if, 
in addition, n is taken to be a positive integer (it can be 
arbitrary), then the value of this polynomial can easily be computed. 
Difficulties arise only when we begin to compute the integral 

dt. We shall prove that by making n sufficiently large we 



l + t 



can make the integral arbitrarily small for — 1 < P < 1. If this 
is so, then in computing ln(l + P) we can neglect the last integral; 
this will result in no more than a negligible error. Consequently 
we obtain the following approximate equality: 

B 2 B 3 B 4 B" 2 " 



15. To estimate the error of this approximate equality we must 

consider the deleted integral 



dt. Assume, first, that 

1 +t 







< P < 1. Then, within the limits of integration, t remains positive 
and, consequently, 

< -r — < t 2 ". 
1 + t 

t 2 " 

This means that the graph of the function y = lies below 

6 v / 1 + 1 

that of the function y = t ln (Fig. 28); therefore, the area CBA t 

is less than the area CBA, i. e. 



• 2n p R 2n+1 

- — dt<\t 2n dt= /— ,- 
+ 1 i In + 1 



,2n 
f 



Thus, the error of the approximate equation derived above is 

g2n+l 

less than — — — ; since < p < 1, this error can be made 

2n + 1 

arbitrarily small when n is sufficiently large. Take, for instance, 
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P = 1 ; in this case the formula derived above will yield 



ln2*l-y + y 



with an error less than 



1 1 

l 



l 



• + ...+ 



l 



6 In - 1 In ' 

. . If we use this method to calculate 
in + 1 

In 2 with an accuracy to within 0.001, then we must assume 

— to be less than 0.001, i.e. In + 1 > 1000; this condition 

2n + 1 




Fig. 28 



can be satisfied by putting In = 1000. But in this case the left-hand 
side of the equation will contain 1000 terms whose sum must be 
computed. It goes without saying that this is a difficult job. 
We shall soon learn how we can do it with the aid of another 
formula for In 2. 



f t 2 " 
16. Let us again consider the integral dt, 

o 
putting - 1 < p < 0. We know that 



but now 



p o 

C t 2n dt _ f t Zn dt 

J 1 + t ~ J 1 + t 
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The integral 



c t 2n dt . 

ts 

J l + t 



equal to the area of the figure ABCK 



shown hatched in Fig. 29. This figure is located above the line Ct, 

t 2 " 
since y - > for f > — 1 . Consequently the area of ABCK 

o 



is a positive number, i. e. the integral 



f t 2n dt 
J ! + ' 



is a positive 




Fig. 29 



f t 2 "dt 
J 1 + ' 



number. It differs from the integral | * ,"* only in sign and is 

o 
therefore equal to the absolute value of the latter: 



(• t 2n dt C t 2n dt 



p 
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Further, let us note that for t > P and > - 1 the inequaiit 
1 + t > 1 + ? > 

holds, and hence, 

1 1 

< "7 — -: 



1 + f 1 + 

and 



r 
< 



I + / 1 4 P 

t 2 " 

This means that the graph of the function v --- — lies b*'!c" 

iff 

t 2 " 
the graph of the function y — - — j, in the interval p -: 

(Fig. 29). Therefore the area of the figure ABCK is less tbar ihut 
of ABCL: 





(• t 2 "dt [j 2 "dt 

J i + r ""J r+p 

p p 



It is easy to calculate the right-hand side of the inequality: 
o 

I — - — t 2n dt = — - — \t u dt = 
J 1 + p 1 + P J 

P 

I 2 " +l — ft 2 " ' ' B 2,l+ ' 

= T~-Hp" 2n"+l = ~ l2nT T)(TT P) 

(this is a positive number since p 2 " M <0, 1 + P>0 ant! 
In + 1 > 0). It follows that 

r t 2n _ c t 2n dt __ _ _p^'_ _ 

J 1+f f ~ J 1 +t" < ' (2« + lj(l "'+ p) ' 
o p 

P 

f t 2n dt 
And so by deleting the term in the expression for 

o 
ln(l + p) we make an error less in absolute value than 
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p2n + l 

(-1<P<0). It tends to zero for indefinitely 



(2« + 0(1 + P) 
increasing n. 
Thus the approximate formula 

B 2 B 3 B 4 B 2n 

is valid with an accuracy to within the factor 
p2„ + i 



(2n+l)(l+P) 



for - 1 < p < 0. 



Let us make, for instance, P = — — ; in this case the error of 
the approximation will be less than 



1 



y (2n + 1) 



1 



(2n+l)2 2 



If we take n = 4, the last fraction will be equal to 

-r-^rg- = q—^tt = ~^^kt~ < 0.0005. This means that with this degree 
of accuracy we can write 

_1_ J 1_ 1 _1 1_ 

2* 2 2 2 -2~ 2 3 -3 2*4 2 5 -5 

1 1 1 



2 6 -6 2 7 -7 2 8 -8 
The requisite calculations yield 

y = 0.5000; y/y = 0-1250; y^y = 0.0417; y^ = 0.0156 



y^y- = 0.0062; y^- = 0.0026; y^y = 0.001 1 ; y/y = 0.0005 

and we obtain 

In 4- * -0.6927% -0.693 

2 

with an accuracy to within 0.001 (we take into account that the 
formula itself could involve an error of up to 0.0005, and, in 
addition, an error up to 0.00005 could arise when we reduce 
each of the eight summands to a decimal fraction). 
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Since In— = — In 2, it follows that 

In 2 as 0.693. 

2 
If we assume B = - -r- in the approximate formula for 

ln(l + P), we can use the same method to calculate 
In— and, consequently, In 3= —In— as well. In general, if we 

assume B = — ; — we obtain In [ 1 — ; — ) = In — and, 

k+1 \ k+ I J k+ 1 

consequently, In (k + 1) =— In -r — p. However, this method of 

K ~t~ 1 

computing logarithms is still very cumbersome. For instance, if we 
intend to calculate In 11, then taking, k + 1 = 1 1, i.e. k = 10, 

we should have B = — , in which case the error in the 

approximate formula will be less than 

10Y" +1 ^ ,J, 10 \ 11/10 
-n- :<2n+ lWl- 



11 I v ; V 11 / 2n+ 1 \ 11 



We have: 

i° 

1 



^o,, ; (-y,o.s3 ; (-;^ ; (-;,„. 48; 



So we see that only by taking In + 1 = 65 can we guarantee 
that the error in the approximate formula for computing 

In-— will be less than — ^- 0.005 « 0.001. 
11 65 

It is evident that the computation of In — - in this case will be 
tedious, since we have to calculate the sum of 64 terms: 

JO 1 /jpY_ J_/J0_Y_ _L(}®Y 

11 2\ llj 3 \ll ) "' 64 ^ ll7 ' 
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17. The conclusions we have come to concerning the approximate 
formula for ln(l + P) force us to look for another formula 
requiring fewer numerical calculations. Such a. formula does exist, 
and to obtain it we take an arbitrary positive integer k and put 

B = -=-. — . We then have 

K 2k + 1 

/ 1 \ _J 1 1 

\ 2k + 1 ) * 2k + 1 2(2k + l) 2 + 3(2fcT17 

1 1 1 



4(2* + l) 4 (2« - l)(2/c + l) 2 "- 1 2n(2* + l) 2 " ' 

The error in this approximate equality is less than 

+y-. Now let us take a negative p equal 



(2n + 1)(2* + 1) 

to - -T7 --. This time we obtain another approximate equality 

, /. •i \ L __J L_ 

\ 2k+l)~ 2k + 1 2(2*+ I) 2 3(2* +1) 3 
1 1 1 



4(2* + l) 4 '" (2n - l)(2fc + l) 2 "" * 2n(2* + l) 2 
yielding In I 1 — rr — — I with an error less than 

_ (2 - + 1 >( i -2*tt)]- 



1 



(2k + 1) 2 " +1 ' 



2k + 1 1 



2* 2n+ 1 (2*+ l) 2n+1 ' 

Subtracting the second approximate equality term by term from the 
first we obtain 



2k + 1 ) \ 2k + 1 

2 2 2 



2k + 1 ' 3(2* + l) 3 + 5(2* + l) 5 + '" 

2 



"■■ (2n- 1)(2* + l) 2 
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The error in this approximate equality does not exceed, in 
absolute value, the sum of the errors of the formulas for 

lnl 1 + jt — p I and In 1 1 — I, and so it is less than 

1 1 2k + 1 1 1 



In +1 (2k + 1) 2 " + r 2k 2n + 1 (2k + l) 2n+ ' ' 

_ 4k+ 1 1_ 1 

2k ' 2n+l ' (2k+l) 2n+1 < 

4k + 2 1 1 1 



2k 2/1+1 (2* + l) 2n+1 k(2n + \)(2k + l) 2 " ' 

We transform the difference between the logarithms, noting that 
it must be equal to the logarithm of the quotient. We receive 



1+ l 



ln(1+ 1 Wl-^=ln * +1 



2k + 1 J \ 2k + 1 / i 



2k + 1 



, 2k + 2 , k + 1 ... ,. , . 
= ln — — — =ln — t — =ln(/c + 1) -ln/c. 

And so we have 

ln(A + 1) - ln/c * -^-j- + , m ; 3 + .... + ... 



2fc+ 1 3(2/c + l) 3 5(2^+ l) 5 



'• + (2n-l)(2*+l) 2 "- 1 W 



with an error less than 



k 2n+l (2k + l) 2n " 

This is precisely the formula we need. It allows ln(/c+l) 
to be calculated when lnfc is known. Making use of the fact that 
lnl =6 and assuming k = 1, we can find In 2 with an error less 
than 

_1 1_ 

2n + 1 ~W' 

Let us take n = 5; in this case we can be sure that the error 
will be less than 1/1 1 • 1/3 10 = 1/(1 1 • 59049) < 0.000002. 
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This yields 

2 2 2 2 2 

ln2 = ln2-lnl* T + -- F+1 -3 r + TT7 -+ ? ^ 

with an error less than 0.000002. Converting each of the five 
fractions into a decimal to six places (i. e. accurate to within 
0.0000005) and adding we obtain the value of In 2 with an 
accuracy to within 0.000002 + 0.0000005 • 5 < 0.000005: 

ln2« 0.693146 x 0.69315. 
Now putting k = 2 and n = 3 in formula (*) we receive 

ta3-ln2*2+ 3^+^0.40546 

with an error less than 1/2- 1/7- 1/5 6 = ,. ,^. g < 0.000005. 

14- 15625 

Therefore 

In 3 v In 2 + 0.40546 % 1.09861. 

Next we obtain In 4 = 2 In 2 x 1.38630; assuming k = 4 and n = 3 
in formula (*) we receive 

ln5 -ln4*! +I l3- + ^*0.223144*0.22314 



with an error less .than 

111 1 



< 0.0000001 



4 7 9 6 28-531441 

and consequently 

In 5* In 4 + 0.223 14* 1.60944. 

Now we can easily find In 10: 

In 10 = In 5 + In 2* 2.30259. 
And, finally, putting k = 10 and n = 2 in formula (*) we obtain 

In 1 1 - In 10 « ~ + - ;?,, * 0.09531 
21 3-2P 

(here the error in the approximate formula is less than 

1/10- 1/5- 1/21 4 * 0.0000001). 

Therefore 

In 1 1 « In 10 + 0.09531 * 2.39790. 
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These examples are sufficient to understand how the table 
of natural logarithms is constructed. This is the way to construct 
the following table of logarithms of integers from 1 to 100, calculated 
with an accuracy to within the factor 0.0005. 







Table 


of Natural Logarithms (from 


1 to 100) 






n 


Inn 


n 


ln« 


n 


Inn 


n 


\nn 


H 


Inn 


i 


0.000 


21 


3.045 


41 


3.714 


61 


4.111 


81 


4.394 


2 


0.693 


22 


3.091 


42 


3.738 


62 


4.127 


82 


4.407 


3 


1.099 


23 


3.135 


43 


3.761 


63 


4.143 


83 


4.419 


4 


1.386 


24 


3.178 


44 


3.784 


64 


4.159 


84 


4.431 


5 


1.609 


25 


3.219 


45 


3.807 


65 


4.174 


85 


4.443 


6 


1.792 


26. 


3.258 


46 


3.829 


66 


4.190 


86 


4.454 


7 


1.946 


.27 


3.296 


47 


3.850 


67 


4.205 


87 


4.466 


8 


2.079 


28 


3.332 


48 


3.871 


68 


4.220 


88 


4.477 


9 


2.197 


29 


3.367 


49 


3.892 


69 


4.234 


89 


4.489 


10 


2.303 


30 


3.401 


50 


3.912 


70 


4.248 


90 


4.500 


11 


2.398 


3! 


3.434 


51 


3.932 


71 


4.263 


91 


4.511 


12 


2.485 


32 


3.466 


52 


3.951 


72 


4.277 


92 


4.522 


13 


2.565 


33 


3.497 


53 


3.970 


73 


4.290 


93 


4.533 


14 


2.639 


34 


3.526 


54 


3.989 


74 


4.304 


94 


4.543 


15 


2.708 


35 


3.555 


55 


4.007 


75 


4.317 


95 


4.554 


16 


2.773 


36 


3.584 


56 


4.025 


76 


4.331 


96 


4.564 


17 


2.833 


37 


3.611 


57 


4.043 


77 


4.344 


97 


4.575 


18 


2.890 


38 


3.638 


58 


4.060 


78 


4.357 


98 


4.585 


19 


2.944 


39 


3.664 


59 


4.078 


79 


4.369 


99 


4.595 


20 


2.996 


40 


3.689 


60 


4.094 


80 


4.382 


100 


4.605 



18. We have seen that the logarithm of a product can be 
found by means of addition, the logarithm of a quotient by means of 
subtraction, the logarithm of a power by multiplication (by the 
exponent), and the logarithm of a root by division (by the 
index of the root). Therefore, if we have a table in which 
logarithms are written next to the numbers (a table of logarithms), 
then with the aid of that table we can replace multiplication 
by addition, division by subtraction, raising to a power by 
multiplication and obtaining a root by division, that is to say, 
every time a more complex operation is replaced by a simpler 
one You can learn how to do it from a high school algebra 
book so here we shall only consider a simple example. 
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Say we want to calculate c = yl. Making use of the value 
In 2 v 0.693 computed above let us divide it by 5 to obtain 

lnj/2 = 1/5 In 2 « 0.139. It remains to find the number ]/l by 
its logarithm. Our table is not very convenient, for it contains 
logarithms 0.000 (corresponding to the number 1) and 0.693 
(corresponding to the number 2), the first being too small and 
the second too large. From this we can only conclude that 

l<|/2<2. But it can be noted that ln(10|/2) = In 10 + 

+ In |/2 = 2.303 + 0.139 = 2.442. In our table the nearest smaller 
logarithm is 2.398 (=lnll), and the nearest larger logarithm is 

2.485 ( = In 12), consequently 11 < \Q]f2 < 12. Noting that 2.442 lies 
approximately between In 1 1 and In 12 (the arithmetic mean of 

the last numbers is 2.441) we can put 101/2% 11.5, i.e. j/2 « 
x 1.15. To check the result obtained take note of the fact that 

In (100 j/2) = In 100 + In ]/l = 4.605 + 0.139 = 4.744 
and 

In 115 = ln5 + In 23 = 1.609 + 3.135 = 4.744. 

•9. To construct the graph of the function y = lnx, it is 
necessary to choose coordinate axes and a scale unit and, then, 
for every x (x > 0) mark off the value of In x on the line 
perpendicular to the x-axis and raised from the respective 
point on that axis. The end points of the perpendiculars obtained 
for various values of x will be located on a curve constituting 
the graph of the natural logarithm. The graph of the logarithm is shown 
in Fig. 30a. Fig. 30b, located below, depicts lnx as an area, so that 
the two cases can be compared. Both figures are drawn to the same 
scale. If we take one and the same value of x, we can show 
that the number of unit squares contained in the area of the 
curvilinear trapezoid ACDB in Fig. 306 is equal to the. number 
of units of length contained in line segment XL shown in Fig. 30a. 

Let us note that if < x' < 1, then 



In 



'-J*--/* 



that is, lnx' is a negative number whose absolute value is equal 
to the area of the trapezoid B'D'CA ; therefore, for this case In x' 
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will be shown in Fig. 30a by a line segment K'L' marked off 
downwards from the x-axis. 

All the properties of the graph of the function _y = lnx follow 
from the definition and the properties of the natural logarithm. 
For example, lnx is negative for x < 1, vanishes for x= 1 and 
is positive for x > 1. Consequently, the graph of the logarithm 
is located below Ox for x < 1, intersects Ox for x = 1 and is 
above Ox for x > 1. Further, y = lnx increases with increasing x. 




This property is obvious when x > 1 (see Fig. 30b), but it is also 
true for x = x' < 1. Indeed, if x' increases, remaining less than unity, 
the absolute value of the area B'D'CA (Fig. 306) decreases, and 
this means that lnx, which differs from that area only in sign, 
increases. 

This increase in the logarithm is seen in the graph in the 
form of a curve rising as if uphill from left to right. This 
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hill, steep at first, becomes flatter and flatter. For greater clarity, 
we shall call the graph of the logarithm depicted here a logarithmic 
slope. 

If we cut a horizontal path along the v-axis and proceed along 
this path from the point to the right, then, looking down, 
we first see an infinite gulf with the logarithmic slope lost in its 
depth. However, we need only to take a step equal to the unit oi 
length to leave the gulf behind. Continuing our walk along the 
path, with each step we find ourselves higher and higher on the slope. 
Thus, after two steps (x = 2) our height will be In 2 = 0.693, after 
three steps In 3 = 1.099, etc. Let us calculate how great the 
increase in the height of the slope will be when after m steps 
we take one more step. Since after m steps (a unit of icngth each) 
the height of the slope will be equal to In m, and after m + 1 
steps it will be ln(m+l), the increase in the height of the 
slope corresponding to one step is 

In (m + 1) - In m = In = In ( 1 + — 

m \ m 

The greater the number of steps we take the less will be the 

number — , the more 1 H will approach unity, and the closer 

mm 

In [ 1 -) 1 will be to zero. This means that the steepness of the 

V m J 

slope becomes less and less noticeable as we move to the right, 
i.e. the logarithmic slope becomes indeed less steep. 

The comparative flatness of the slope does not prevent it from 
rising indefinitely, so that when we go sufficiently far along 
the horizontal path the slope will rise above us indefinitely. 

Indeed, after we make 2 m steps the height of the slope will be 
equal to 

In2 m = mln2= 0.693 m, 

and for a sufficiently large m this number will be arbitrarily 
large. 

If instead of a horizontal path we cut some other straight 
path with a rise, however slight (Fig. 31a), and travel along it, 
then, sooner or later we shall reach the logarithmic slope, 
and shall even leave it down below us when we rise further 
upwards (Fig. 31fr). 

To be sure of this let us prove the following lemma: for any 
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natural m there holds the inequality 

Am 



4m 



Indeed, when m increases by 1 the fraction — 5- increases, i. e. 

m 
4m 41H+1 

< 



m 2 (m+ l) 2 




Fig. 31 



this follows from the inequality 
4«i+ 1 4 



(m + l) 2 ' m 5 



4m 2 _ / 1m V _ / ro + m V > 
" (ro+1) 2 "Vm+lJ "Vm+i; 



which holds for m > 1. This is why from among the fractions 
41 42 4m 

-p-> -jr. ••••-^r. ••• 

the first always has the lowest value, i. e. 

41 4"! 

this is what we wanted to prove. 

Now note that for each point on the sloping straight path 
there holds the relation 



y = x tan a, 



VH-oi \r,-> 
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where a is the angle of inclination of the path (a is an 
acute angle and, consequently, tana>0). If we take x = 4 m , the 
altitude of the slope of the path for this value of x will be 
4 m tana, and the height of the logarithmic slope will equal 
ln(4 m ) = mln4. The ratio of the first height to the second 
will be 



4 m tan a 
mln4 



4 m tana 
m 2 ln4 



m. 



4>n 

But according to what was proved above, — r > 4, and therefore 

m 

the ratio between the height of the path and the height of the 

logarithmic slope is not less than — m, and for a sufficiently 

large m this value can be made arbitrarily great. Consequently, 
for x = 4 m and a high value of m the rising straight path will be 
considerably higher than the logarithmic slope (see Fig. 31b). 
It is remarkable that the logarithmic slope has a rounded shape, 
without any irregularities, and is convex throughout. This 
property can be expressed in geometric terms: every arc of the graph 
of the logarithm lies above the chord of that arc (Fig. 32). 
Denoting the abscissas of the end points of an arbitrary arc 




Fig. 32 



LiL 2 by x t x 2 we can make sure that for the mean value of 

x = l — — a point on the arc L must indeed lie above the 
corresponding midpoint of the chord M. 
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In fact, 

*1 + *2 



NL= In 



2 
and 



(as the middle line of the trapezoid), i. e. 

lnxi + lnx 2 



NM = 



2 



We have to prove that 



, x. + x 2 lnxi + lnx 2 
In— — — — > 



2 2 

But we have 

lnx t + lnx 2 1. , . . ./ 

l -j = yln(x 1 x 2 ) = ln|/x 1 x 2 . 

Therefore it is necessary to prove that 

In — 5__ — L > ln|/x 1 x 2 . 

Let us note that 

(\/x[ - \/xi) 2 = x t - 2 ]/x^x 1 + x 2 > 

(in case Xi and x 2 are positive numbers not equal to each other). 
Therefore 

x, +x 2 > 2 ]/x,x 2 , 

next 

Xt +x 2 



^ > j/x^x 
and finally 



Xi + X 2 



In l 2 > In ]/x t x 2 . 

The proof is complete. 

Thus, whatever the arc of the graph of the logarithm, the point 
of the arc corresponding to the arithmetic mean of the abscissas 
of its end points always lies above the midpoint of the chord. 
This implies that the graph of a logarithm can have no troughs. 
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Indeed, if there were such a trough (Fig. 33), an arc would be 
possible for which the above property would not hold (the 
midpoint of the chord M would lie not below, but above the 
corresponding point of the arc L). 

Proceeding from the properties of logarithms, we could derive 
some other interesting properties of the graph of a logarithm, but we 
shall restrict ourselves to those just considered. 




Fig. 33 



20. We may encounter natural logarithms while solving many 
problems in mathematics and physics which at first glance appear 
to have nothing to do with the areas of curvilinear trapezoids 
bounded by the arcs of a hyperbola. Here is one of the problems 
of this kind studied by the eminent Russian mathematician 
P. L. Chebyshev: he wanted to find the simplest formula possible 
for an approximate calculation of all prime numbers not exceeding 
some given (arbitrary) number n. 

If n is not large, then the question as to the quantity of prime 
numbers denoted by n (n) (here n has nothing in common with the 
familiar number 3.14159...) is decided very simply. Thus, if 
n = 10, the prime numbers not exceeding 10 are the following: 
2, 3, 5, -7; there are four of them and consequently n(10) = 4. 
If n = 100, then we make use of the familiar method known as 
Eratosthenes' sieve and obtain 25 prime numbers: 2, 3, 5, 7, 11, 13, 
17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59, 61, 67, 71, 73, 79, 
83, 89, 97; consequently n(100) = 25. However for a large n the 
problem becomes rather difficult. How can we calculate n(n), even 
very approximately, when n is equal to a million, a thousand million, 
and so on? 

Chebyshev found that to calculate n(n) approximately it is 
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sufficient to divide n by the natural logarithm of n: 

7i(n)% ; 

In /i 

the relative error of this equality (the error measured in fractions 
of the number it (n)) may be very large, but it tends to zero when n 
tends to infinity. Chebyshev's approximation formula becomes very 
convenient in the case of n equal to a power of 10 with a positive 
integral exponent: n = 10*. Then we obtain Inn = In 10* = /cln 10 * 
« 2.303 k and, hence, 

*( 10 ^13W- 
Making use of the fact that ^r^rr * 0.434, we can obtain a 

formula still more convenient for computations: 

10* 
jr( 10*)* 0.434-^-. 

Thus, for k = \ and k = 2 we find: 

it (10)% 0.434- 10 = 4.34 (the correct result is 4), 

100 
7t( 100)% 0.434— — ss 21.7 (the correct result is 25). 

If we continued the calculations we would obtain the following: 

1000 
71(1000)35 0.434-— —3= 145 (the correct result is 168), 

10000 
Jt(10000)% 0.434 ]p-« 1090 (the correct result is 1229), 

10 6 
jc(10 6 ) =s 0.434 •——% 72300 (the correct result is 78498). 
6 



The relative error of the last result is 

78498-72 300 



78498 



0.08, 



i. e. 8 per cent, it is still considerable. However, a very rigorous 
proof can be presented that the relative error of Chebyshev's 
formula can be made infinitely small if 10* is sufficiently large. 
At some stage it will be smaller than one per cent. Further it 
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will become less, than 0.1 per cent; next, less than 0.001 per cent, 
etc. This explains why Chebyshev's formula is. of great theoretical 
value. 

To Chebyshev's credit is another formula for an approximate 
calculation of n(n), somewhat more difficult but allowing a better 
approximation: 



■«-te 



Without doing the calculations we shall give here only some 
results: 

1000 

M 177 (it(1000) = 168); 



lnt 



2 
10 000 



1 



dt se1245 (it(10000) = 1229), 



lnt 

2 
000 000 

Y~ « 78 627 (it (1 000 000) = 78 498). 

2 

Hence, the relative error of the approximate equality 

1 ooo ooo 

n(l 000000)* f -^ 

2 

is equal to 

|78498-78627| 



78498 



: 0.0016, 



that is 0.16 per cent. 
21. We saw that 



ln2 = 0.69315 < 1, and In 3 = 1.09861 > 1. 
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This means that the area of ACDB (Fig. 34) is less than 1, and the 
area of ACD l B 1 is greater than 1. It can be expected that 
somewhere between the points D and D y there will be found a 
point D' such that the area of ACD'B' will be equal to 1. 
Such a point D' does exist. If we denote OD' by e we may state 
that 2 < e < 3. Making use of the table of logarithms given on 




Fig. 34 

p. 45 we find that 2.7 < e < 2.8. Indeed, 

In 2.7 = In 27 -In 10 w 0.993 
and 

In 2.8 = In 28 -In 10= 1.029. 

There exist various methods of finding e with any degree of accuracy. 
Without considering them in detail, we shall only give the result: 

ex 2.71828 

(all the digits written here are correct). By definition 

lne= 1. 

The number e is termed the base of natural or Napierian 
logarithms, named after John Napier, a Scottish mathematician 
who published the first table of logarithms (in 1614). 

Proceeding from the properties of natural logarithms we can 
prove the following remarkable proposition: the natural logarithm 
of any positive number b is equal to the exponent to which the 
number e should be raised in order to obtain b. In other words, 
if In b = a, then b = e". For example, from the fact that In 2 % 0.69315 
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it follows that 2xe 069ils \ from In 10 % 2.30259 it follows that 
10 * e 2 - 30259 , etc. 

To prove this it is sufficient to use the property of the logarithm 
of a power. Let b = e x ; then Inb = lne* = xlne. But lne=l, 
therefore 

In b = x, 
i. e. the natural logarithm of b coincides with the exponent x. 

Thus we see that natural logarithms can be determined 
without resort to geometric representations. It could be said from 
the very beginning that the natural logarithm of the number b 
is an exponent of the power to which the number e x 2i71828 
must be raised to obtain the number b. But this definition does 
not show clearly enough why we are interested in the 
exponents of the power of the number e and not of some other 
number. Now if natural logarithms are introduced as areas, their 
definition becomes pictorial enough, and does not leave any doubt. 

We should say, at this point, that besides natural logarithms, 
some other logarithms can be introduced, with another base. 
Thus, for instance, a common logarithm of the number b is the 
exponent of the power to which the number 10 should be raised to 
obtain the number b. The common logarithm of the number b 
is denoted log b. If we have log b = p\ then, by definition, we must have 
b = 10 e ; it is evident that log 10 = 1. Common logarithms are 
studied in high school, and there all their properties are 
derived not by geometrical means, but on the basis of the known 
properties of power exponents. 

There is a simple relationship between common and natural 
logarithms. Let In b = a and log b = p. This means that b = e* 
and fc = 10 p , i.e. e' = 10 p . Consequently, In e" = In 10" or 
aln <? = pin 10, i. e. a = p • 2.30259. Thus we have Inb = 2.30259 logb, 
whence 

\ogb= 23Q 1 259 lnb = 0.43429 In b. 

Having before us the table of natural logarithms and multiplying 
each logarithm by 0.43429 we obtain the table of common 
logarithms. 
For example, 

log 2 = 0.43429 In 2 = 0.43429-0.69315 x 0.30103. 

For log 10 we must obtain unity: 

log 10 = 0.43429 In 10 x 0.43429 • 2.30259 = 1. 
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The fact that the number 10 is taken as the base of common 
logarithms (the number 10 is the base of the decimal system of 
notation) considerably simplifies logarithmic computations. Thus, 
knowing that log 2 = 0.30103 and log 10 = 1, we immediately obtain 

log 20 = log 2 + log 10 = log 2 + 1 = 1.30103, 

log 200 = log 2 + log 100 = log 2 + 2 = 2.30103, 

and so on. 

Now, if we know that In 2 = 0.69315 and In 10 = 2.302585 and 
want to calculate In 20 and In 200, we should proceed as follows: 

In 20 = In 2 + In 10 = 0.69315 + 2.30259 = 2.99574, 

In 200 = ln2 + In 100 = ln2 + 21n 10 = 

= 0.69315 + 4.60517 = 5.29832. 

This explains why when using logarithms as an auxiliary means 
in computations we prefer to employ the tables of common 
logarithms. But this in no way belittles the importance of natural 
logarithms, which are encountered in the solution of many problems 
in mathematics and the natural sciences. In this book we 
discussed two mathematical problems leading to natural logarithms, 
that is the problem concerning the area of an equilateral hyperbola 
and Chebyshev's problem on the distribution of prime numbers. 
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SUPPLEMENT 



1. To calculate the area of a curvilinear trapezoid we replaced 
it either by the sum of the areas of rectangles (p. 8), or by the sum of 
the areas of rectilinear trapezoids (p. 33). Using the latter method we 
got for In 2 the approximate value 0.6937. On p. 44 we employed 
another method to compute In 2 and obtained the value 0.69315, 
which contains an error less than 0.000005. Hence we see that the 
calculation with the aid of trapezoids led to an error greater than 
0.0005, notwithstanding the fact that we divided the line segment 
CD in Fig. 26a into many parts (10) so that the altitudes 
of the rectilinear trapezoid represented by these segments constituted 
only 0.1 each. 

There are other methods of approximation suitable for any 
curvilinear trapezoid and leading to highly accurate results while 
requiring computations no more difficult than with the methods 
just described. We shall now consider a method named after an 
eminent English mathematician Thomas Simpson (1710-1761), 
although a method like it was suggested 75 years earlier by his 
countryman James Gregory (1638-1675). The main idea is to replace 
the arcs of the graph of the function, not by chords, as was done in 
the case of rectilinear trapezoids (see Fig. 26b), but by the arcs of 
parabolas. 

In high-school algebra books the term parabola is used to describe 
the graph of the function y = ax 2 . Here we shall use this term 
in a wider sense, describing as a parabola the graph of any 
function of the form y = ax 2 + bx + c. For a # and b = c = it 
is a parabola in its ordinary position, with a vertex at the origin. 
If a^0, as before, but one of the coefficients, b or c, is also 
different from zero, then, as can be verified, the graph of the function 
is the same parabola but with its vertex transferred to some 
point different from the origin (Fig. 35). Finally, if a = 0, we have a 
straight line y = bx + c. Taking this into account, we shall continue 
to call it here a parabola, interpreting it as a special case. 

Let us prove now that through three points of a plane, A (x , y ), 
B(xt, yi), C(x 2 , y 2 ), with pairwise distinct abscissas x , x, and x 2 
we can draw one and only one parabola. This means that there 
exist coefficients a, b and c such that the graph of the function 
y = ax 2 + bx + c will pass through each of the indicated points. 
In this case the values of the coefficients are determined 
uniquely by the choice of the points A, B and C. 

To prove this assertion we take a, b and c as unknowns. 
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The coefficients a, b and c we have to find must satisfy the 
following three conditions: the graph of the function y — ax 2 + bx + c 
must pass (1) through the point A(x , y ), (2) through the point 
B(x it yi) and (3) through the point C(x 2 , y 2 ). In other words, 
the value of the required function is y for x — x , y t for 
x = x, and y = y 2 for x = x 2 . Therefore we obtain three equations 



y=x 2 -4x+5 




Fig. 35 

with three unknowns a, b and c: 

y = axl + bx + c, ( ! , 

y\ = ax 2 + bxi + c, (2) 

y 2 = ax\ + bx 2 + c. (3) 

If we subtract termwise the first equation from the second and the 
second from the third, we shall have a system of two equations with 
the unknowns a and b: 

a(x\ - xl) + fe(x, - x ), (1) 



yi-yo 

y 2 -y x = a(x 2 2 - x\) + b(x 2 - x,). 



(2') 



Here it is convenient to divide all the terms of the first equation by 
X! - x , and of the second by x 2 - x,. By the hypothesis thest 
numbers are known and are different from zero. So we obtain 



yi 


-yo 


*i 


- x 


^2 


-yi 



a(x t + x ) + b, 



- =a(x 2 + Xj) + b. 



(1") 

(2") 

x 2 -x, 

Now we subtract the first equation of the last system from the 
second equation and obtain a single equation defining a: 

yi -y\ y\- yo 



X 2 — Xi Xj — Xq 



= a(x 2 - x ). 
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By dividing all the terms of the last equation by the number 
x 2 - x (not equal to zero) we find the only possible value of the 
required coefficient a Let us substitute it, say, into equation (1"). 
Then from this equation we immediately find the only possible value 
of coefficient b. Finally, substituting the values of a and b thus 
found into equation (1) we find the only possible value of the last 
unknown coefficient c. We believe the presentation of the calculations 
here to be unnecessary. The expressions for a, b and c will 
evidently be rational numbers, specified by the coordinates of the 
three given points (verify this, beginning with a). 

Thus it follows from our discussion that the coefficients a, b and c 
cannot possess any other values except those given above. Hence 
there exists only one parabola passing through the three given 
points. It is easy to verify that the obtained values of a, b 
and c satisfy equations (1), (2) and (3). 

2. Now take three points A(x , y ), B(*i, yi) and C(x 2 , y 2 ) 
with pairwise distinct abscissas such that x < X! < x 2 and Xi is 
located exactly in the middle of the segment [x , x 2 ]. This 



means that Xj = 



x + x 2 



According to what was proved above 



one and only one parabola, y = ox 2 + bx + c, passes through 
them (Fig. 36). Let us consider the area S of the curvilinear 
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Fig. 36 



trapezoid ADEC. We shall prove that it is equal to 

—^—? — —{yo + 4yi + y 2 ). In other words, for the parabola passing 
o 

through the given points the following formula is valid: 



S = -53L 



*o 



(yo + 4y, + y 2 ). 



(4) 
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The case is not excluded when all the three points lie on a single 
straight line. If this line is parallel to the x-axis (Fig. 37a), 
then y 2 = y\ = yo, and from formula (4) we obtain the expression 

for S: S= — i -~ — - ■6vq = (x 2 — x )y a . But this is precisely the 

area of the corresponding rectangle (according to the condition 
introduced on p. 7 it is expressed by a negative number if 
y < 0). Now if the straight line is not parallel to the x-axis 
(Fig. 31b), then S is equal to the area of the rectilinear 
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Fig. 37 



trapezoid taken with the requisite sign. Its midline is y v = 
_ yo yi an( j its a ] t j tu( j e j s e q Ua i to Xz _ Xq Substituting in 

formula (4) 2y, for the sum y + y 2 , we receive S = 

= 2 ~ ~ (2y 1 +4y l ) = (x 2 -x )y i , again a correct result. By 

6 
the way, to prove formula (4) there is ■ no need to specially 
consider each individual case. In all cases the proof is the same. 

Let us calculate the coefficients a, b and c by the technique 
considered in the previous section. Then y = ax 2 + bx + c is a function 
whose graph passes through the given points. This means that the 
three equalities (1), (2) and (3) are satisfied. Now we shall prove 
formula (4) without expressing the coefficients a, b and c in terms of 
the coordinates of the points A, B and C (in fact, we did not 
give these calculations in Section 1). We shall simply make sure 
that the formula is valid. The reader is asked to accept on trust 
that the reasoning of the proof is correct; otherwise some cumbersome 
computations would be necessary to find the expressions for a, b 
and c. 

First we express the area S, which we can term the area of a 
parabolic trapezoid, as an integral. Using the known properties of 
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X 2 X, X 

integrals and the formulas for j x 2 dx, )xdx and J dx, we obtain 

x x x 

Xi Xi X 2 X] 

S = j(ax 2 + bx + c)dx = a \x*dx + b \xdx + c \dx = 

JC<> X Xq X q 

= a ?L^L +b A^± +c{X2 - Xo) = 

2a (xl - Xq) + lb{x\ - Xq) + 6c (x 2 - x ) 



All the binomials placed in parentheses here have a common 
factor x 2 - x . This is obvious for the last binomial and, besides, 
xl - *o = ( x 2 ~ *o)(*2 + *o) and xl - xl = (x 2 - x )(xj + x 2 x + 
+ Xq). Consequently, moving the common factor outside the 
parentheses we express S in the form 

S = Xl ~ *° [a(2xi + 2x 2 x + 2x1) + b(3x 2 + 3x ) + 6c]. 
6 

Comparing the result with formula (4), which we are trying to 
prove, we see that it only remains to verify the equation 

a (2x 2 2 + 2x 2 x + 2x1) + b (3x 2 + 3x ) + 6c = y + fyi + v 2 . (5) 

Use the expressions (1), (2) and (3) for y u y 2 and y 3 . Since the 
left-hand side of equation (5) does not contain the abscissa X! 
of the point B, we replace x t by the expression 

—5— — — before substituting the expression for y x . We then have 
yi =a(*^) 2 + b(^-) + c, 



whence 

4.V! = a(x^ + 2x x 2 + xl) + b(2x + 2x 2 ) + 4c. 

Consequently 

>o + 4yi + y 2 - (axl + bx + c) + 

+ [a(xl + 2x x 2 + xl) + b(2x + 2x 2 ) + 4c] + (a 2 x\ + bx 2 + c) = 
= 0(2x1, + 2x x 2 + 2x 2 2 ) + b(ix + 3x 2 ) + 6c 
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(we have combined the terms containing the same coefficient 
a, b or c). Thus we see that formula (5) is valid and consequently 
formula (4) is valid too. 

Applying this formula we can immediately see, for instance, that 
in the case of x = 2, x t = 3, x 2 = 4, and y = 2, y t = 4, y 2 = 3, 
the area of the parabolic trapezoid ADEC (see Fig. 36) is equal to 

4-2 

— - — (2 + 4-4 + 3) = 7. This is the exact result. But a graph of 

6 
some other function which is not a parabola can pass through the 
same points A, B and C (in Fig. 36 it is shown by a dashed 
line). If we replace this dashed line by a solid line, an arc of a parabola, 
and then calculate the area of the corresponding curvilinear 
trapezoid, the previous result, 7, will no longer be exact, but 
approximate. Making use of this idea let us once more find an 
approximate value of In 2. We made an approximate calculation 

2 

of the integral — , and for this purpose make an approximate 



replacement of the arc of the hyperbola y = — by an arc of the 
parabola passing through the points A, B and C (Fig. 38), for 




Fig. 38 



which x = 1, x 2 = 2, x, = 
1 



x + x. 



= 1.5, and y = 



*o 



= 1, y 2 = — = -r- and >', = — = — . The parabola is not 
x? 2 Xi 3 



given in the figure since in this case it differs very little from 
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the arc of the hyperbola. Applying formula (4) we obtain 



In 2 = 



[ dx 1 A „ 2 1 
_*_ (l+.4 T + T 



= § = 0.694. 



The approximation we have obtained is rather close; a more exact 
value of In 2 is 0.69315 (see p. 44). 
3. To attain an error as small as possible in computing 

b 

\f{x)dx, the interval between a and b is divided into n equal 

a 

parts. Then the are of the graph of the function y =f(x) is 
also divided into n arcs. In accordance with what was said above we 
replace each of them by the arc of a parabola. Then we obtain an 
approximate expression for the integral as the sum of the areas of n 
parabolic trapezoids. Each of them can be found separately with the 
aid of formula (4). As a result we can find an approximate 
expression for the integral. 

All the above can be expressed as a formula named after Simpson. 

Let us designate, in turn, the abscissas of the points dividing 
the line segment between a and b into n equal parts by the letter x 
with even indices: x = a, x 2 , x 4 , ..., x 2 „- 2 , x 2n = b (in 
Fig. 39 n — 8). Now x with odd indices will designate the 




Fig. 39 



midpoints of the corresponding parts, i.e. x t = ° — — , x 3 = 
= -^~^,...,x 2 „^ = **-* + **■ . Each of the arcs of the 

graph /IBjCj, C 2 B 3 C^, ..., C 2 „- 2 B 2 „^ 1 C 2 „ is replaced by the arc of 
the parabola passing through three points: the end-points of the 
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arc and the point located above the middle of the corresponding 
interval of the x-axis. These are not in the drawing since they 
almost merge with the arcs of the graph under consideration. 
The areas of the parabolic trapezoids derived by formula (4) are 
expressed as follows: 

- * 2 ~ 6 X ° {yo + 4y, + y 2 ), X * ~ * 2 {y 2 + 4y 3 + y 4 ), . . . 

X 2n — *2»-2 / , a i 

• • • . 1 b>2n- 2 + 4^2»- 1 + y2n), 

and in accordance with the above reasoning their sum gives an 

b 

approximate value of the integral jf(x)dx. Before, putting down 

a 

this sum note that the difference between the two x's and the 
neighbouring even numbers is : 



_ b ~ a 
x 2 — x — x 4 — x 2 — . . . — x 2 „ — x 2n _ 2 — . 

n 

Consequently, putting the common factor — - — outside the 

on 



brackets we obtain 

» — 
1m 



]f(x)dx x —z~\.(y + 4y. + y 2 ) + 



+ (yi + 4y 3 + y*) + ■ ■ ■ + (y 2 n-2 + 4y 2 „- 1 + y 2 »)l 
i. e. we finally obtain the expression 

+ 2(y 2 + y A + ... + y 2 „_ 2 ) + 4(y, + y 3 + ... + y 2 „-i)]. (6) 

This is Simpson's formula in its general form. The extreme ordinates 
in square brackets are taken with the coefficient 1, all the other 
ordinates with even indices are taken with the coefficient 2, 
and those with odd indices with the coefficient 4. 
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Formula (4) in the preceding article can be considered as a 
special case of Simpson's formula, when we use only one parabolic 
trapezoid, i. e. n = 1. We see, for instance, that it gives In 2 
with an error of the order of 0.001. Let us make sure that 
for n = 5 Simpson's formula allows us to calculate In 2 with an 
error of the order of 0.0000001. Thus, let us make use of 

2 

Simpson's formula (6) to calculate — taking n = 5. Here a = 1, 

i 

b = 2, fix) = — ; for n = 5 we obtain x = 1, x 2 = 1.2, x 4 = 1.4, 

x 
x 6 = 1.6, x 8 = 1.8, x 10 = 2.0, X! = 14, x 3 = 1.3, x 5 = 1.5, x 7 = 
= 1.7, x 9 = 1.9. We calculate the values of ordinates to seven 
decimal points (with an accuracy to within 0.00000005), and at 
once compile the necessary sums to substitute them into 

Simpson's formula. We receive: y = = 1.0000000, y l0 = 

x o 

= —=0.5000000, y { + y i0 = 1.5000000; y 2 = — = 0.8333333, 

Xio x 2 

y 4 = —=0.7142857, y 6 = — = 0.6250000, y s = — = 0.5555556, 
X4 x 6 x g 

2(y 2 + ?4 + y 6 + y*) = 5.4563492; y, = — = 0.9090909, y 3 = 

= — = 0.7692308, y 5 = — = 0.6666667, y 7 = — = 0.5882353, 
X3 x 5 x 7 

y 9 = =0.5263158, 4(y i + y 3 + y 5 + y 7 + y 9 ) = 13.8381580. 

x 9 

Therefore, formula (6) gives for In 2 (n = 5) 

2 

" dx 1 

%t-f( 1.5000000 + 5.4563492 +13.838 1580) = 0.693150. 

x 6-5 



In 2 = 



But using formula (*) on p. 43 we can compute In 2 with any 
degree of accuracy; we have only to assume Jtc = 1, as was done 
on p. 43, and take n to be sufficiently large. Thus we can make 
sure that the value of In 2 corrected to eight decimal points is 
0.69314718. Consequently, the value of In 2 obtained from Simpson's 
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formula differs from the true value by a number of the order 
of 0.000003, that is, the validity of this formula in this case is 
very great. 

A more complicated analysis can demonstrate that Simpson's 
formula can yield a high degree of accuracy even for small n, 
when the graph of the function is very smooth and flat. The accuracy 
decreases when the graph contains very steep sections. 

4. Let us apply Simpson's formula to calculate approximately 
the area of a circle. Since it is proportionate to the square of 
the radius, it is sufficient to carry out the calculations for a circle 
of radius equal to 1. Then, as we know, the area will be equal to 
7i-l 2 =rc. Hence our problem is to calculate approximately the 
number n using Simpson's formula. Using the property of the 
symmetry of a circle, we reduce the calculations to those of a quarter 
of the circle (Fig. 40). Then the result will be an approximate value 



of number 



4' 



In this specific case we cannot expect a high degree of accuracy, 
although we make calculations for n = 8, for the reason of the very 
great steepness of the right side of the graph. Below we shall show 
what should be done in this case to improve the result. But 
now we shall begin the calculations. Since y is expressed in teims of x 



by the formula y = ]/\ - x 2 (Fig. 40), the problem reduces to 




computing the integral J j/l - x 2 dx by Simpson's formula. We shall 



assume n = 8. The abscissas of the point of division with even 
numbers will then pass through — , and those with odd numbers 

o 
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will differ from the preceding points that are nearest to them by 

1 113 

— . We obtain: x = 0, x 16 = 1, *2=-g' x *=-j, X6= ~$' 

1 _ 5 3 7 1 3 

•*8 — ~y> ^10 — ~g~> *12 — ~~T> *14 — jF' Xl — T/T' Xi ~ ~wT' x$ ~ 

5 7 9 11 13 15 

= IT * 7 ~ 16" ' * 9 " "16"' *" ~ T6"' Xl3 ~ 16"' Xl5 " 16" 
Let us calculate the corresponding ordinates by the formula 
y = J/1— x 2 and complete the sums which are to be substituted 

into Simpson's formula. We receive: v = 1, y 16 = 0.0000, y + 
+ y 16 = 1.0000, y 2 = 0.9922, y 4 = 0.9682, y 6 = 0.9270, y 8 = 0.8660, 
y 10 = 0.7806, y l2 = 0.6614, y 14 = 0.4841; 2(y 2 + y 4 + y 6 + y e + 
+ yio + yn + yu) = 1 1-3590; y, = 0.9980, y 3 = 0.9823, y 5 = 0.9499, 
y 7 = 0.8992, y 9 = 0.8268, y u = 0.7262, y u = 0.5830, y 15 = 0.3480; 
4(yi +y3 + ys + y7 + y9 + yn + yn + yis) = 25.2536. It follows 
that 

-1 = \]/{-x 1 dx % ^-(1.0000 + 11.3590 + 25.2536) = 0.7836. 



When n is computed by other means with an accuracy to within 

0.00005, the result is 3.1416, whence ^- = 0.7854. Hence the 

result obtained by Simpson's formula contains an error of the 

order of 0.002; it should be rounded off to 0.001: ^- ~ 0.784. 

4 
Now we shall use Simpson's formula with the same end in view 

(calculation of n) but in a more favourable situation. We shall 

move some distance from the steep right end of the graph and 

0-5 

consider the integral J j/l — x 2 dx. If we take this integral as 

o 
yielding the area of the curvilinear trapezoid AOCD (Fig. 41) 
and subtract from it the area of the triangle OCD equal to 

1 1 1/5 

= 0.2165064, we obtain the area of the circular sector 

2 2 2 

360° 
AOD with the central angle 30° = . Thus the difference 
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0.5 

J ]/\ -x z dx- 0.2165064 yields the value of one-twelfth of the 

o 

» jt 

area of the circle, i. e. — > Let us use Simpson's formula for 

0.5 

n = 4 to compute J J/1 — x 2 dx; in this case the abscissas of the 
o 

division points x , x l5 x 2 , x 3 , x 4 , x 5 , x 6 , x n and x 8 will 
remain the same as before. But we shall now calculate the 
corresponding ordinates to seven decimal points to attain a high 
degree of accuracy in the result. In this way we receive the following 
values for the ordinates and their sums contained in Simpson's 
formula: y = 1.0000000, y s = 0.8660254, y + y 8 = 1.8660254; v 2 = 
= 0.9921567, y* = 0.9682458, y 6 = 0.9270248; 2(y 2 + y 4 + y 6 ) = 
= 5.7748546; y t = 0.9980450, y 3 = 0.9822646, y 5 = 0.9499178, y 7 = 
= 0.8992184, 4(y t +y 3 + y s + y 7 ) = 15.3177832. Substituting these 




values of the sums of the ordinates into Simpson's formula we 
obtain 



0.5 

I 
O 



0.5 
6-4' 



J ]/\-x 2 dx * ^-(1.8660254 + 5.7748546 + 15.3177832) = 

= 0.4783055. 



In accordance with the above reasoning it follows that 

0.5 



-jj= ] j/l-x 2 dx-0. 



2165064*0.2617991. 
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The accuracy of the result obtained can be verified by multiplying 
it by 12 (thus increasing the error of the result the same 
number of times); this gives 3.1415892 as the value of the number n. 
But with an accuracy to within 0.0000005 the number Jt is equal to 
3.141593 (this can be found by Simpson's formula for greater 
values of n; there exist, however, other computing techniques 
requiring less laborius calculations). Consequently, giving the result 
to only five decimal points we have n « 3.14159, with an 
accuracy of within 0.000005. This very close approximation to the 
famous number n was obtained by a skilful application of 
Simpson's formula. 
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